Learning to generate novel views of objects for class recognition
نویسندگان
چکیده
Multi-view object class recognition can be achieved using existing approaches for single-view object class recognition, by treating different views as entirely independent classes. This strategy requires a large amount of training data for many viewpoints, which can be costly to obtain. We describe a method for constructing a weak three-dimensional model from as few as two views of an object of the target class, and using that model to transform images of objects from one view to several other views, effectively multiplying their value for class recognition. Our approach can be coupled with any 2D image-based recognition system. We show that automatically transformed images dramatically decrease the data requirements for multi-view object class recognition. 2009 Elsevier Inc. All rights reserved.
منابع مشابه
Modified CLPSO-based fuzzy classification System: Color Image Segmentation
Fuzzy segmentation is an effective way of segmenting out objects in images containing both random noise and varying illumination. In this paper, a modified method based on the Comprehensive Learning Particle Swarm Optimization (CLPSO) is proposed for pixel classification in HSI color space by selecting a fuzzy classification system with minimum number of fuzzy rules and minimum number of incorr...
متن کاملClustering of Learning Images and Generation of Multiple Prototypes for Object Recognition
common features in all learning objects only. The In this paper, we propose two methods of clustering learning images to generate prototypes automatically for object recognition. One is for clustering views of a single object and the other is for clustering different objects observed in a similar direction which belong to a same object class. In both two cases, we first group all learning image...
متن کاملView-based Models of 3d Object Recognition and Class-speciic Invariances
This paper describes the main features of a view-based model of object recognition. The model tries to capture general properties to be expected in a biological architecture for object recognition. The basic module is a regularization network in which each of the hidden units is broadly tuned to a speciic view of the object to be recognized. The network output, which may be largely view indepen...
متن کاملSpatiotemporal information during unsupervised learning enhances viewpoint invariant object recognition.
Recognizing objects is difficult because it requires both linking views of an object that can be different and distinguishing objects with similar appearance. Interestingly, people can learn to recognize objects across views in an unsupervised way, without feedback, just from the natural viewing statistics. However, there is intense debate regarding what information during unsupervised learning...
متن کاملLearning the 3-D structure of objects from 2-D views depends on shape, not format
Humans can learn to recognize new objects just from observing example views. However, it is unknown what structural information enables this learning. To address this question, we manipulated the amount of structural information given to subjects during unsupervised learning by varying the format of the trained views. We then tested how format affected participants' ability to discriminate simi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Vision and Image Understanding
دوره 113 شماره
صفحات -
تاریخ انتشار 2009